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Abstract — We derive a lower bound on the capacity pre-log 
of a temporally correlated Rayleigh block-fading multiple-input 
multiple-output (MIMO) channel with T transmit antennas and 
R receive antennas in the noncoherent setting (no a priori 
channel knowledge at the transmitter and the receiver). In 
this model, the fading process changes independently across 
blocks of length L and is temporally correlated within each 
block for each transmit-receive antenna pair, with a given rank 
Q of the corresponding correlation matrix. Our result implies 
that for almost all choices of the coloring matrix that models 
the temporal correlation, the pre-log can be lower-bounded by 
r(l - 1/L) for T < (i - 1)/Q provided that R is sufficiently 
large. The widely used constant block-fading model is equivalent 
to the temporally correlated block-fading model with Q — I 
for the special case when the temporal correlation for each 
transmit-receive antenna pair is the same, which is unlikely to 
be observed in practice. For the constant block-fading model, 
the capacity pre-log is given by T(l T/L), which is smaller 
than our lower bound for the case Q — I. Thus, our result 
suggests that the assumptions underlying the constant block- 
fading model lead to a pessimistic result for the capacity pre-log. 

I. Introduction 

We analyze the capacity of a Rayleigh block-fading mul- 
tiple-input multiple-output (MIMO) channel in the nonco- 
herent setting where the transmitter and the receiver are 
aware of the channel statistics but have no a priori channel 
state information. In this setting, the penalty on capacity' 
incurred by allocating resources to channel estimation is 
automatically accounted for We model channel variations 
in time by the temporally correlated block-fading model 
introduced in [I]. According to this model, the fading process 
takes on independent realizations across blocks of length 
L; however, for each transmit-receive antenna pair, it is 
correlated within each block with a given rank Q of the 
corresponding L x L correlation matrix. 

The capacity of the temporally correlated block-fading 
channel is not known even in the single-input single-output 

This work was supported by the WWTF under grant ICT 10-066 
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'In this paper, the term capacity refers to capacity in the noncoherent 
setting. 



(SISO) case. The capacity pre-log, which is defined as the 
ratio of the capacity to the logarithm of the signal-to-noise 
ratio (SNR) as the SNR goes to infinity, has been character- 
ized in [I] for the SISO case and in [2]-[4] for the single- 
input multiple-output (SIMO) case. For regular stationary 
fading processes, the capacity of the MIMO channel has been 
studied in [5]. It was proved that, in this case, the capacity 
grows only doubly-logarithmically due to the regular fading 
assumption. For nonregular stationary fading processes, the 
MIMO capacity pre-log is not known to date. 

In this paper, we derive a lower bound on the capacity pre- 
log of a rank-Q temporally correlated block-fading MIMO 
channel with block length L, T transmit antennas, and R 
receive antennas. We show that the pre-log is lower-bounded 
by T(l - 1/L) for T < {L - 1)/Q provided that R > 
T{L — 1)/{L — TQ). This lower bound can be achieved for 
almost all (a. a.) choices — i.e., up to a set of measure zero — 
of the coloring matrix that models the temporal correlation 
for the transmit-receive antenna pairs. 

Our result is particularly surprising when compared to the 
capacity for the constant block-fading model as derived by 
Zheng and Tse [6]. The constant block-fading model is a 
special case of the temporally correlated block-fading model 
for Q — 1 that is obtained when the correlation matrices for 
all transmit-receive antenna pairs are assumed to be equal, 
which is unlikely to be observed in practice. Zheng and Tse 
showed that the pre-log for the constant block-fading model 
is M*(l - M*/L) with M* = min{r,i?, [L/2\}, which 
is less than or equal to L/A. In the temporally correlated 
block-fading model for (5 = 1, our lower bound on the pre- 
log is L - 2 + 1/L if T = L - 1 and i? = {L - If for 
a.a. coloring matrices.- This shows that a much higher pre- 
log can be achieved and, hence, the results predicted by the 
constant block-fading model are pessimistic. 

Apart from our main result, the methods employed in its 
proof may be of independent interest. We use a generalized 

-Note that the coloring matrix corresponding to the constant block-fading 
model belongs to the set of measure zero where this bound does not hold. 



change-of-variables theorem for integrals in combination 
with Bezout's theorem [7, Proposition B.2.7] to estabHsh 
certain transformation properties of differential entropy under 
finite-to-one mappings. Furthermore, we use an important 
property of subharmonic functions to lower-bound the in- 
tegral of a certain real analytic function. In the SIMO case, 
a similar problem was recently solved using an algebraic- 
geometry method [3]. Our alternative method works in a 
more general setting and, thus, may also be useful for bound- 
ing differential entropy terms appearing in other problems . 

The rest of this paper is organized as follows. The system 
model is presented in Section II. The lower bound on the 
capacity pre-log is stated and discussed in Section III. A 
proof of the lower bound is provided in Sections IV and V 
and in three appendices. 

Notation: Sets are denoted by calligraphic letters (e.g., 
I), and \I\ denotes the cardinality of I. Sets of sets are 
denoted by fraktur letters (e.g., 971). We use the notation 
[M : N] = {M, M+l,...,N} for M, N eK Boldface 
uppercase (lowercase) letters denote matrices (vectors). Sans 
serif letters denote random quantities, e.g., A is a random 
matrix and x is a random vector The superscripts ^ and 
^ stand for transposition and Hermitian transposition, re- 
spectively. The all-zero matrix or vector of appropriate size 
is written as 0, and the M x M identity matrix as Im- 
For a matrix A <E C^^^^, the element in the ith row and 
jth column is denoted by aij. We denote by , where 
I C [1 : M] and J C [1 : N], the \I\ x \ J\ submatrix 
of A containing the elements Uij with i E I and j E J\ 



furthermore, [A]^- = and \A\^ = {A^y^.j^^y 

denote by e C'-^' the subvector of x containing the 
elements Xi with i E I. The diagonal matrix with the 
elements of x in its main diagonal is denoted by diag(a;). 
We define diag(Ai, . . . , Ak) as the block diagonal matrix 
with the matrices Ai, . . . , Ak on the main block diagonal. 
The modulus of the determinant of a square matrix A is 
denoted by \A\. For a; S M, \x\ — maxjrn G Z | m < a;} 
and \x\ = min{m E Z \ m > x}. We write E[-] for the 
expectation operator, and CAfdj,, S) for the distribution of 
a jointly proper Gaussian random vector with mean /j, and 
covariance matrix S. The Jacobian matrix of a differentiable 
function is denoted by J^. 

II. System Model 

We consider a MIMO channel with T transmit and B 
receive antennas. The fading process associated with each 
transmit-receive antenna pair conforms to the temporally 
correlated block-fading model [1], which results in the fol- 
lowing channel input-output relations within a given block 
of length L: 
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We 



diag(hr,t) Xt + Hr 



e[l:R]. (1) 



E 

te[l:T] 

Here, Xt G is the signal vector transmitted by the 
tth transmit antenna; y,, e is the vector received by 
the rth receive antenna; h^,* ^ CAf{0, Zr,tZ^t)' where 



Zr,t e with Q = Tank{Zr,tZ^t), is the vector of 

channel coefficients between the tth transmit antenna and the 
rth receive antenna; ~ CN{Q,Il) is the noise vector at 
the 7'th receive antenna; and p G K+ is the SNR. The vectors 
hr,t and are assumed to be mutually independent and 
independent across r G and t G [1:7], and to change 

in an independent fashion from block to block ("block- 
memoryless" assumption). The transmitted signal vectors Xt 
are assumed to be independent of the vectors h,,t and n^. 
We note that the channel coefficient vectors can be written 
as 



Zr,tSr,t , 



with the Q-dimensional whitened vectors Srt ^ CAf{0, Iq). 
Setting y 4 (yT . . . ^ yT)T ^ ^^.l and n ^ (rT . . . , nT)T 

G C^^, the R input-output relations (1) can be written more 
compactly as 



with 




^tZl,: 



XtZ 



St e 



(2) 



(3) 



R,t/ 



where we have defined Xt = diag(xt) G 



■^LxL 



and St = 



(si.-- 



, J"^ G C^*^. For later use, we also define x 



s 4 



(Si, 



z ^ 



T ..,s?)TgC™'3, and 

ZlT 



G 



^RLxTQ 



\Zr,i ■ ■ ■ Zb„T^ 

We will refer to Z as the coloring matrix. 

III. A Lower Bound on the Capacity Pre-log 

Because of the block-memoryless assumption, the coding 
theorem in [8, Section 7.3] implies that the capacity of the 
channel (2) is given by 



1 



C{p) = -sup/(x;y) , 



(4) 



Here, /(x;y) denotes mutual information [9, p. 251] and the 
supremum is taken over all input distributions / on C^^ that 
satisfy the average power constraint 

E[||xf ] < TL. 

The capacity pre-log is then defined as 

C{p) 



X = lim 



p^oa log(p) 



(5) 



We will obtain the main result of this paper, which is stated 
in Theorem 1 below, by maximizing with respect to T the 
lower bound on x given in the following proposition. 

Proposition 1: For T<i?, there exists a set ZCC^-^^^'^ 
with a complement of Lebesgue measure zero such that for 
each coloring matrix Z E Z, the capacity pre-log of the 
channel (1) satisfies 



X > Xiow(r) ^ min^Tl 1-^ ),i?( 1-^ 



L 



L 



(6) 



Proof: See Section IV. ■ 
The main result of this paper is stated in the following 
theorem. 

Theorem 1: There exists a set Z CC^^^^'^ with a com- 
plement of Lebesgue measure zero such that for each coloring 
matrix Z ^Z, the capacity pre-log of the channel (1) satisfies 

> * A ,r( ifT<ropt 

X > Xiow = < V LJ (7) 



if T > Topt , 



where 

rj = max 



(8) 



and 



To 



RL 



pt 



L + RQ^l 



< mm{R, L/Q} . (9) 



Proof: We obtain a lower bound on the pre-log for T trans- 
mit antennas by maximizing xiow with respect to the number 
of effectively used transmit antennas (note that we can always 
switch off some antennas). Thus, we will take the maximum 
of Xiow(r*) in (6) with respect to T* < min{T, i?}. Here, 
T* is also restricted hy T* < R because Proposition 1 holds 
only for T* < R. Because x\ow{T*) is the minimum of two 
quantities where the first is monotonically increasing in T* 
and the second is monotonically decreasing in T*, it attains 
its maximum at the intersection point Topt defined in (9). If 
T < Topt, we have xiow{T) = T(l - 1 /L), which proves the 
first case in (7). For T > Topt, we have to take into account 
that Topt might not be a natural number Thus, we have to 
take the maximum of xiow ( [T'opt J ) and xiow( [T'optl which 
turns out to be 77 in (8). This shows the second case in (7) 
and concludes the proof. ■ 

Remark 1: The set Z will be specified in Definition 1 in 
Section V. 

Remark 2: For a fixed R, the maximum value of xiow 
in (7) is obtained by using either [ToptJ or [Topt] transmit 
antennas. This implies that the optimal number of transmit 
antennas is upper-bounded by [Topt] ■ 

Remark 3: For L = Q, x*ovj equal to zero and hence 
trivial. 

Remark 4: The lower bound Xiow ™ '-^'^ expressed 

as f / 1 \ 

Xiow = min|rn -- j,?? 

Remarks: X\ov, can be at most \{L -1) /Q\{1 - 1/ L). 
This value of Xiow attained for T = [{L-l)/Q\ and 
R=\{L-lf/Ql 

Remark 6: By (9), the condition T < Topt in (7) is 
equivalent to i? > T{L -I) / {L - TQ). Thus, for a fixed 
T <L/Q, we can always obtain Xiow^ T{l — 1/L) by using 
a sufficiently large R. 

Remark 7: If all matrices Zr,t for r G [1 : and t G [1 : T] 
coincide, the temporally correlated block-fading model for 



Q = 1 reduces to the constant block-fading model studied 
in [6]. The pre-log in the constant block-fading model is 
M*{l-M*/L), where M* = min{T,i?, lL/2\y, therefore, 
it is upper-bounded by i/4. On the other hand. Theorem 1 
for (5 = 1 implies that the pre-log for the correlated block- 
fading model is lower-bounded by (cf. (6)) 



X > min <j T ( 1 - 



(10) 



for a. a. coloring matrices Z eZ. In particular, for T = L — 1 
and R — [L — 1)^, the lower bound in (10) becomes 
i — 2 + 1/L. Thus, for a. a. choices of coloring matrices, 
the pre-log is much higher than in the constant block-fading 
model. ^ Hence, the results predicted by the constant block- 
fading model are pessimistic. 

IV. Proof of Proposition 1 

For L < TQ, the inequality in (6) is trivially true, 
because in this case xiow < 0. Therefore, it remains to 
prove (6) for L > TQ, which will thus be assumed in 
the following. By (4), the capacity can be lower-bounded 
as C{p) > (l/L)/(x;y) with the specific input distribution 
X ^ CAf{0, Itl)- Inserting this lower bound into (5) then 
gives 

1 /(x;y) 



X > — lim / N ■ 
L P^OO log(yo) 



(11) 



In what follows, we thus assume that x ~' CJ\f{0, Itl)- 

We have /(x;y) = h{y) — ^(y|x) with h denoting 
differential entropy. Hence, we can lower-bound /(x;y) by 
upper-bounding h{y\x) and lower-bounding h{y). Similar to 
[2, Eq. (8)], we have 



h{y\x) < TQR\og{p) + 0{1) 



(12) 



where "+ 0(1)" means "up to a function of p that is bounded 
for p — > 00." Furthermore, similar to [2, Eq. (12)], we have 



h{y) > I \Tr\]\og{p) + h{Py) 

\re\l:R] / 



(13) 



re[l:R] 

where y was defined in (3), 



P ^ diag([Ii]^^,...,[Ji]^J GC^-[-«ll^'-l"^^ (14) 

the Ir C [1:L] for ?- G [1 : R] are certain subsets that will be 
specified later, and c is a finite constant. Note that in (13), 
h{Py) and c do not depend on p. Using (12) and (13) in 
/(x;y) = h{y) — /i(y|x), we obtain 

/(x;y)> [ \Ir\-TQR\\og{p) + h{Py) + 0{l) 

\re[l:_Rl / n< 



(15) 



The proposed lower bound on the pre-log in (6) is established 
by inserting (15) into (11) and choosing the sets {2, }re[i:_R] 

^This implies that the coloiing matrices coiTesponding to the constant 
block-fading model belong to the complement of Z (which has Lebesgue 
measure zero and is unlikely to be observed in practice). 



such that 

\Ir\ = min{ri -T + TQR, RL} , (16) 

re[l:R] 

provided that h{Py) > — cxd. It remains to show that there 
exist sets {Ir}re[i:R] satisfying (16) and a set Z C C^lxtq 
with a complement of Lebesgue measure zero for which 
h{Py) > — oo for each Z ^ Z. This will be done in the 
next section. 

V. Proof that h{Py) > ~oo 

Let us split the vector x into the vectors x-p = ([xi]p^ , . . . , 
[xt]p^) and = ([xi]!^^, . . . , [xt]p^) , where T't C [1 : 
L] and Vt = [l:L]\Pt for t e [1:T]. Because h{Py\x-p) < 
h{Py), it is sufficient to show that h{Py\x-p) > — oo. As 
in [3], we wish to relate h{Py\x-p) to the simpler quantity 
/i(s,x-p) = h{s) + h{xx))- This will be done via the family 
of xp -parametrized mappings 



: {s,xv) >-> Py, 
where y is defined in (3), i.e., 

y 



J2 Stst, 

te[l:T] 



with 



-BLxRQ 



(17) 



(18) 



(19) 



According to (18) and (19), the components of each vector- 
valued mapping (fy^^ are multivariate polynomials of degree 
2. The Jacobian matrix J^^^ of each mapping 0^;^ is equal 
to 



/ 



^1,1 • • • AlJ 



ll • • • 



Ar^I ■ ■ ■ Ar^t J 

-E,.eii:„i|2:.|x(rQfl. + Ete[i:Tilftl) 



where 



(20) 



[ diag (aW , . . . , a^;^^)] i G [1 : T] , r € [1 : i?] 



with ai'l ^ [Zr,t]{i]Sr,t, le[l:L\ 



Note that by (14), J^^J^s, xx>) can be written as 



(21) 



where 




(22) 



Based on the family of mappings (p^^ in (17), the relation 
between h{Py \ x-p) and h{s, xp) can be established by using 
the definition of conditional differential entropy [9, Chapter 
8] and by applying the change-of-variables theorem for 
integrals under finite-to-one mappings'* [10, Theorem 3.2.5]. 
For this, we need to show that the family of mappings (j)^^ 
is finite-to-one almost everywhere (a.e.) for a.a. choices of 
x-p. We now define the set Z for which this proof works. 

Definition 1: Let Z C C^lxtq jj^g matrices Z 

such that the following holds: There exist a choice of sets 
{^r}re[i:R] Satisfying (16), i.e., 

\Ir\ = miii{TL -T + TQR, RL} , (23) 

r<£[l:R] 

and a choice of sets {Vt}teli:T] satisfying 

Y l^tl = mSix{T,TQR-{R-T)L}, (24) 

te[l:T] 

such that J^^^{s, xj)) is nonsingular a.e. for a.a. choices of 
x-p. 

We will show presently that the set Z is nonempty. In fact, 
it covers a.a. of {£,RLxtq^ 

Condition (23) on {|Ir|}re[i:i?] and condition (24) on 
{|^t|}te[i:T] guarantee that the matrix J0^^(s,a;u) is square. 
More specifically, we have with (20) that 



#rows 



rG[l:-R] 



min{rL -T + TQR, RL} , (25) 



where (23) was used, and 
#columns = TQR+ ^ 1^*1 (^6) 

te[l:T] 

= TQR + TL- E \'Pt\ 

te[l:T] 

= TQR + TL- max{T, TQR - (R- T)L} 
= mm{TQR + TL-T, RL} , (27) 

where (24) was used. Thus, comparing (25) and (27), we 
have #rows = #columns. 

The next lemma states that Z satisfies one of the claims 
made in Proposition 1. 

Lemma 1: The complement of the set Z has Lebesgue 
measure zero. 

Proof: See Appendix A. ■ 
In the remainder of our proof that h{Py) > — oo, we 
consider an arbitrary Z eZ. To use the change-of-variables 
theorem, we will invoke Bezout's theorem to show that the 
mappings <f)x^ are finite-to-one a.e. 

Lemma 2: Let be defined as the set of all (s, xx>) such 
that JcI,^^{s,xt>) is nonsingular. Then for all y G (l)xj,{M-), 



[XtZR^]j-^^ 



"^For a finite-to-one mapping, the inverse image of each point in the 
codomain is a set of finite cardinality. 



we have 

I0.i({y})n>(| 



< A 2(Ete[i:TiP*l + ^Qfl) . (28) 



Proof: Let y e <i)xT,{M). Then according to (17)-(19), 
the zeros of the vector-valued mapping 

{s,x-p) ^ (l)x^{s,x-o) - y 

are the common zeros of X]te[i T] 1-^*1 + TQR polynomials 
of degree 2. Thus, by a weak version of Bezout's theorem [7, 
Proposition B.2.7], the number of isolated zeros (i.e., with no 
other zeros in some neighborhood) cannot exceed rh. Since 
Jcf,^^ is nonsingular on A4, the function (f>xj, restricted to A4 
is locally one-to-one and, hence, (p^-p — V has only isolated 
zeros on M. Therefore, the number of points (s, xv) £ 
such that (■s, xv) = y cannot exceed fh. ■ 
Next, we will establish a transformation property of dif- 
ferential entropy under finite-to-one mappings in a general 
setting. More specifically, we will obtain a lower bound on 
differential entropy using the change-of-variables theorem for 
finite-to-one mappings [10, Theorem 3.2.5] in combination 
with the uniform bound in Lemma 2. 

Lemma 3: Let u G C" be a random vector with contin- 
uous density function /„. Furthermore, let k: C" C" be 
a continuously differentiable mapping with Jacobian matrix 
and let M ^ {u e C : |Jk(m)| ^ 0} and v = k(u). 
Assume that the complement of A4 has Lebesgue measure 
zero and \ti~^{{v}) fl M\ < m < oo for all G C", with 
some constant to G N. Then there exists a set C C" such 
that 

/l(v) > — mlog(TO)— TO / /u(tt) log(/u(tt)) (ilt 

Ju 

+ m f /u(tt)log(|J«(M)ndw. 



Proof: See Appendix B. 



To lower-bound h{Py 
ferential entropies h{Py 



\-p), we first lower-bound the dif- 
x-p^x-p). By Lemma 2, we have 
n A^l < TO. Furthermore, since we assume Z G 
Z, we have by Definition 1 that J^^^{s, x-p) is nonsingular 
a.e. and, hence, the complement of M. has Lebesgue measure 
zero. Thus, we can invoke Lemma 3 with h{\i) = /i(Py|xp= 
x-p), K = (pxv, u = (s, xp), and to = to to obtain 

h{Py\xp^xp) > — TOlog(m) 

- m fs,XT^is,Xv)log{fs,XTyis,Xv))d{s,X-D) 

Ju 

+ m fs,XT,{s,xv)log{\J4>^Js,xv)\'^)d{s,xv). (29) 
Ju 

Using (29), we can now lower-bound h{Py\x-p) as follows: 
h{Py\x-p) = J f^^{x-p)h{Py\x-p^ x-p)dxp 
TO log(m) 



> / 



rh / /s,xi,(s,a'i?)log(/s,xi,(s,a;i?))c'(s,a;-D) 
Ju 



ml fs,„T,{s,XD)\og{\J^ (s,xd)\ )d{s,XT)) dxp. 
lu 

(30) 

The lower bound in (30) consists of three terms. The first 
term is just a finite constant. The second term is finite 
because the differential entropy of the Gaussian random 
vector {s,Xd) is finite. The last term is finite if 

/ fsAs,x)\og{\J^ {s,xv)\'^)d{s,x) (31) 

j£TL+TQR 

is finite. To show that (31) is finite, we will invoke the 
following general result for analytic functions. 

Lemma 4: Let / be an analytic function on that is not 
identically zero. Then 

h= I cxp(-||||niog(|/(0|)dC> (32) 

Proof: See Appendix C. ■ 
Since /g x is the density of a standard multivariate Gaussian 
random vector and Aet{J^^JyS,xx>)) is a complex polyno- 
mial that is not identically zero due to the definition of Z in 
Definition 1, the integral in (31) is finite by Lemma 4. Hence, 
with (30), we obtain h{Py\xp) > —oo. This concludes the 
proof that h{Py) > — oo. 

Appendix A: Proof of Lemma 1 

We can view det{J^^^{s,xx>)) as a function f{Z,x,s). 
Assume that there is a choice of sets {Ir}re[i:R] satisfying 
(23) and a choice of {Vt}t£ii:T] satisfying (24) such that 
f{Zo,Xo,So) ^ at some {Zq,Xo, Sq). Thus, because for 
fixed Xq and Sq the function f{Z,XQ,So) is a polynomial 
in the entries of Z and hence analytic in Z, there is a set 
Z CC^'"^^'^ with a complement of Lebesgue measure zero 
such that f{Z,XQ,So) ^ for all Z ^ Z. Hence, for each 
fixed Zi gZ, f{Zi, X, s) is not identically zero; furthermore, 
it is analytic in x and s. Therefore, it is nonzero for a. a. 
{x, s). We conclude that J^^^{s^ x-p) is nonsingular and thus 
Zi G Z. Definition 1 impUes that Z C Z, and hence the 
complement of Z has Lebesgue measure zero. 

It remains to find choices of {2r}re[i:i?,] ™d {'Pt}te[i:T] 
such that f{Z,x,s) 7^ at some {Z,x,s). We start 
by choosing sets {Irjre [1:_r] '^hat satisfy (23). Let k = 
min { [{TL - T) /{L - TQ)\ ,R) and i ^ TL - T ~ {L - 
TQ)\_{TL-T)/{L-TQ)l and define 




if r G [1 : fc] 

if r = fc + 1 (33) 
if r G [k + 2:R]. 



For this choice, [1 : TQ] C 1^. for all r G [1 : i?], and as many 
Xr as possible without violating (23) are equal to [1 The 
sets {Vt^t^li-.T] have to satisfy (cf. (24)) 



te[l:T] 



Vt\ = miix{T,TQR- {R-T)L} = i?/ 



(34) 



We define the sets Vt such that 1 Vi, 2 e P2, ■ ■ ■ , T e Pt, 
and further T+1 G Vi, T+2 G V2, etc., up to L G Vl mod t- 



If (34) is not yet satisfied, we look for the minimal t' such 
that \Vt' I is minimal and 1 ^ Vt' and start again with 1 G 
Vf, 2 S Vt'+i, ■ ■ ■ We proceed until (34) is satisfied. This 
construction of the sets Vt can be formulated as 



Vt ^ {ie[l:L]:3j e [1:19^] such that i = j mod L 

J -I 



and j 



lcm(r, L) 



= t mod T 



(35) 



where lcm(-, •) denotes the least common multiple. For 
example, for T = R = 3, L = 6, and Q = 1, we have 
-dR = 9 and (35) yields Vi = {1,4,3}, V2 = {2,5,1}, 
and V3 ~ {3,6,2}. Note that since the sizes of the sets Vt 
differ at most by 1, (35) together with (34) yields 



\Vt\ < 



< 



max{T, TQR - {R- T)L} 
T 

mayi{T,TQR - {R-T)TQ} 



T 



(36) 



where L > TQ has been used. Some properties of the sets 
Vt are summarized in the following lemma, whose proof is 
omitted due to space limitations. 

Lemma 5: Suppose that R>T. Let T^t e [1 : L] be defined 
according to (35) but for R—1 receive antennas (i.e., R is 
formally replaced by R—1) and set Ct — Vt\Vt- Then 

(i) Ct n = for t ^ t' 

(ii) Ct C In 

(iii) There exist pairwise disjoint sets Qt satisfying \Qt \ — Q, 
Gt^Vt^ 0, and g = Ute[i:T] = ^fl\Ut6[i:T] ^t- 



We will also make repeated use of the following result, 
which is a corollary of [11, pp. 21-22]. 

Lemma 6: Let M e C^^^ and let I,JC[1:N] 



J 



with \I\ = \J\. If [M]j^i 



if [M]'^ is nonsingular, then dct(iVf) ^ if and only if 

det([M][};;;;|\^)^o. 

Remark 8: Lemma 6 is just an abstract way to describe a 
situation where given a matrix M, one is able to make row 
and column interchanges that yield a new matrix of the form 
( 0^ c ) where A and C are square matrices. In this case, it is 
a basic result that the determinant of M equals the product 
of the determinants of A and C. 

For the choices of {Vt}te[i:T] ^nd {1r}re[i:R] described 
above, it now remains to find x, s, and Z such that 
f{Z,x,s) = dct( J0^^(s, a;x))) is nonzero. This will be 
done by an induction argument over R>T. 

Induction hypothesis: For R > T {as assumed in Propo- 
sition 1), {Vt}teii:T] as in (35), and {Ir}r£[i:R] as in (33), 
there exists a point {Z, x, s) with a; = (1, . . . , 1)^ such that 
f{Z, X, s) — det( J0^^(s, x-p)) is nonzero. 

Base case (proof for R = T): We have to show that the 
determinant of the matrix in (22) is nonzero for R = T. For 



N]\X 



or [M]^^^^i\-^= 0, and 



R^T, (34) reduces to Ete[i:T]l^tl ^^Q' ™d with (36), 
we obtain \Vt\ = TQ. Furthermore, from (33), I,. = [1 : i] 
for r e [1:2^]. We choose s,._t = for r ^ t, and we choose 
[Zr,t]'Pr such that [{Zr^i ■■■ Zr^r)]^ is nonsingular We 
have [Ar,t]vt (cf- (21), noting that Vt D Vt = 0). 
Hence, we can use Lemma 6 with M ^ dct(j0^ (s,a;x))) 
given by (22) and [M]^ diag ■ • ■ Z^.t)]^^,. ■ ■ , 

[(^Zt.i ■■■ Zt,t)]^^) ■ It thus remains to show that the 
determinant of the matrix [-^][i'j^j\x corresponding to 



(37) 



[A 



T,T\ 



is nonzero. Because of (21), this matrix is a diagonal matrix 
and can be chosen to have nonzero elements by choosing 
[Zt^tlj,^ and St,t such that [Zt^t]{i}St,t 7^ for all i e Vt. 
Thus, the matrix in (37) is a diagonal matrix with nonzero 
entries and hence its determinant is nonzero. 

Inductive step: We have to show that we can find Z^^t and 
•s/f.t for t € [1 :T] such that the determinant of the matrix 
J^^^{s, xx)) in (22) is nonzero assuming that it is nonzero 
for the R—1 setting. Let Q, Qt, and Ct be as in Lemma 5 and 
let gt eGtnVt due to Lemma 5). Set [ZRj]g\g^ = 0. 
Furthermore, let [Zji^t]Gt nonsingular for all t £ [1:2^]. 
It easily follows that ([Zij^ijg • • • [Z/j tIs) is nonsingular. 
Next, we choose s/j.t such that it is orthogonal to the rows 
of [ZR.t]g^\{g^} and satisfies [ZR^t]{gt}SR,t + 0. With (21) 
and gt G Vt, we then obtain [ylij^jjg = 0, t e [1 : T\. Hence, 
according to Lemma 6 with M given by (22) and \M.\x = 
••• \Zr_t\g), the determinant of J^^^{s,Xt)) in 
(22) is nonzero if and only if the determinant of the following 
matrix is nonzero; 



where 



[A 



i,iJ 



[^«,l]Uen,Tl-Ct 



[A 



e[i:T] 



[Zi 



'■'t — 



[ZR-l,t]x^_J 



By choosing the remaining rows of ZR,t appropriately, we 
obtain [AR^t]^y^^,^^^^^^ c,,)\c,^^ ^"^^ {[AR.t]'l\) + 0. By 
Lemma 6, it can then be easily seen that the determinant of 
Jii>^JyS, xd) in (22) is nonzero if and only if the determinant 
of (22) for i?— 1 is nonzero, which is true by the induction 
hypothesis. 

Appendix B: Proof of Lemma 3 

First, we state the version of the change-of-variables 
theorem [10, Theorem 3.2.5] that we will use. 

Lemma 7: Let '0 : C" — > C" be a differentiable mapping 
with Jacobian matrix J^. Then for any measurable, nonneg- 



ative, real-valued function g on C" and any measurable set 
S C C" , we have 



g{ip{u)) \J,p{u)\ du = g{v)'NT{ip\S,v) dv , 
s Jc"' 

where Nr(V;|5,t>) denotes the number of points it G 5 such 
that ipiu) = V. (Note, in particular, that Nr('(/;|5,t>) = if 
there is no tt G 5 such that V'(m) = v.) 

We will also make use of the following lemma to obtain 
one-to-one mappings with maximal support. 

Lemma 8: For any Lebesgue measurable set ,/l C C" and 
any mapping t/j: C" ^ C" such that \'i}}~^{{v^) n „4| < 
m < oo for all v G C", there exists a Lebesgue measurable 
set B C ^ such that is one-to-one and tpi^^) = "^i-^)- 
Furthermore, \ip^^{{v}) D (^\;B)| < m — 1 < cx) for all 

V G C". 

Proof: Let DJl denote the set of all measurable subsets 

V C A such that tfj^^ is one-to-one. We have the natural 
partial order of inclusion on 9)1. For any chain (i.e., totally 
ordered set) £ of sets in the union of all sets in £ is an 
upper bound for all sets in £ (i.e., for any Aq £ <t we have 
^0 ^ Ucee'^) belongs to DJl. Thus, by Zorn's lemma, 
there exists at least one maximal element in Let ;B be a 
maximal element in DJl. If there exists a v G 

we can add one point u G ip~'^{{v}) to B and B U {u} 
belongs to DJl with B ^ B U {u}, which is a contradiction 
to the maximality of B. Hence, i.'{13) — i^iA). Furthermore, 
since i3 G 971 the set B is measureable and is one-to-one. 
Finally, for each v G ^/'(.A), there exists a u G B such that 
iP{u) = V. Thus, \'iJj-\{v})n{A\B)\ < \ilj~\{v})nA)\- 

1 < TO-1. ■ 

For Ai and m as defined in Lemma 3, we now partition 
the set Ai into subsets Vi with i G [1 : m] such that each 
Ki = is one-to-one and C"\ IJ^gj^.^j Vi has Lebesgue 
measure zero. The existence of such sets can be shown by 
using Lemma 8 repeatedly. Next, we define the set U used 
in Lemma 3. Let 



ueM 



k{u) 



> 



Vm g K-\K{{u}))nM 



(38) 



Note that k{U) = k(A4). The set U is measurable since it is 
the preimage of {1} under the measurable function^ 



9{u) 



/„(«--^(k(m))) 
) \J^{k-\k(u)W 



where J-{u) = {i G [1 : m] : k{u) G Ki(Vi)}. By Lemma 8 
with -0 = K and A = IA, there exists a set W C W such that k 



u 



-1 



^The function g is measurable by tlie following ai'gument: la 
continuous by the inverse function theorem. Hence, for all u with equal 
Tiu). the denominator in the definition of g is just the maximum over 
a finite set of continuous functions and thus measureable. Since there are 
only a finite number of possible realizations of T(yL), we can partition the 
domain of g into a finite number of sets where g is measureable. Therefore, 
g is measureable. 



is one-to-one and = k(Z//) = k(A^). Applying Lemma 7 
with g(y) = —/„(?;) log(/„(7;)), ?/> = n, and S ~U yields 
(/„ denotes the density of v = t(u)) 



hiy) = - / Uv)\og{Uv))dv 

Uv)\og{Uv))nT{n\U,v)dv 

= [ A(Ac(M))log(/„(K(M))) \Mu)\^du. (39) 

Here, (a) holds because v is supported (up to a set of measure 
zero) on k{14); note also that Nt{k\U,v) is 1 for v = k{u) 
and else. The next step is to establish a relation between 
the densities /„(k(m)) and fu{u) for u gU. Let U'CU be 
any measurable subset of U. We have 

\J^{u)\^du = 



L{v)dv 

k(U') 

Pr{v G k{U')} 
Pr{M G h-^{k{U'))} 



L 



U{u)du. (40) 



k-1(k(W')) 



Since k,- = kL, , we have 



1(k(W')) 



and since C"\ IJjgjj^.,^] Vi has Lebesgue measure zero, the 

set Uie[i-m] '^i^i^i^')) is equal to k~^{k.(U')) up to a set 
of Lebesgue measure zero. Thus, 

Uu)du = V / Uu)du (41) 

(note that k~^{k,{U')) C Vi and the Vi are disjoint). Using 
for an arbitrary i G [1 : m] Lemma 7 with -0 = and 
S = Ki{K^^{K{W))), and using the inverse function theorem, 

luse Ki = k| 



we obtain (note that = J^^ on Vi because Ki ~ k\^) 



(42) 

Another application of Lemma 7 with = ^ — and 

iS = {ni{n'^^ {k{U' )))) then gives 



/uK-\^)) 



(it) 



We can upper-bound (43) by 



du. (43) 



/„(M)|J«(n)|2 



< 



K-i(K,(re,""(K(W')))) 



/u(m) du 



u{x) < 



ib) f 

< / fu{u)du, 
Jw 



(44) 



where in (a) we used the fact that u E U (we have u G 

oKiy-{U')) QU'<ZU<ZU) and the inequahty in 
(38), and in (6) we used Q W. Note 

that the upper bound (44) does not depend on i G [1 : m]. 
Hence, (40)-(44) yield 

/ /„(k(m)) |JK(M)pdu < ?7i / fuiu)du, 
Jw Jw 

for an arbitrary measurable set U' C Thus, 

|JK(tt)p < m/u(M) a.e.oni^. 
Inserting this into (39) leads to 

Mv)>-/^m/„(n)log(^M^)d« 



> — mlog(m) — m / /„(«) log(/u(it)) c?m 
+ m//„(«)log(|J«(«)ndM. 

Appendix C: Proof of Lemma 4 

Since / is not identically zero, there is a G such that 
/(^o) 7^ 0. Then g(^) = /(^ + ^o) is an analytic function 
that is nonzero at ^ = 0. By changing variables ^ i-^ ^ + ^o, 
we obtain for Ii in (32) 

Ii= [ cxp(-||| + ^of)log(|5(OI)c^l- 
Noting that 

u+^oW < ur+m\\Uo\\ + uor 

< lllf + 2max{||||lM||of}+ll6lP 

< 3||Cf + 3||6f , 

we can lower bound Ii by 

Ii>cf cxp(-3|||f)log(|5(0|)d^ 4 /2, (45) 
Jc" 

with c = exp(-3||^o|P)- Using the mapping Lp: M^w^C^; 
X !->■ (a;[i.jv] + ia;[Ar+i:2W]). we can write I2 in (45) as 



I2 = c exp{—3\\x\\'^) u{x) dx . 



(46) 



with = \og{\g{Lp{x))\). Since ,g(0) / 0, we have 

u{0) > —00. By [12, Example 2.6.1.3], u{x) is a subhar- 
monic function. A useful property of subharmonic functions 
is stated in the following lemma (see [12, Theorem 2.6.2.1]). 

Lemma 9: Let u be a subharmonic function on W C M^^, 
and let x G R^w^ jf ^ ^ W for some r > 0, with B^^r ^ 
{t> gM^^: ||f < r}, then 



u{y) ds{y) , 



where Sx.r — {y G I^^^ : 11?/ ^^11 = '^l' '^2N is the area 
of the unit sphere in M?^ , and ds denotes integration with 
respect to the (2Af — l)-dimensional Hausdorff measure (cf. 
[10, Subsection 2.10.2]). 

Using a well-known measure-theoretic result [10, Theo- 
rem 3.2.12], we obtain 

/ exp(— 3||a;|p) u(a;) da; 

= / / cxp(-3r^)u(y)ds(y)dr . (47) 

J (0.00) Jsn ^ 



We thus have 

(a) 



/a c / / cx.p{~3r^) u{y) ds{y) dr 

J {0,00) JSo.r 

ib) 



(&) f 

> ccr2Aru(0) / exp(— 3r^) 

J(0,oo) 



2^ ^2Ar-l 



dr 



> —00 , 



where (a) follows by using (47) in (46), (b) is due to Lemma 
9, and (c) holds because m(0) > —00. With (45), it then 
follows that Ii > —00. 
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